Added Error Logging #164

ParamThakkar123 · 2025-09-12T13:09:14Z

Fixes #74

ParamThakkar123 · 2025-09-12T13:33:43Z

PaliC

Thanks for the PR!

This is in the right direction, however, we have a colleague whose using this code to run experiments (@shaahins please take a look), so I want to be careful about touching this code.

Generally I have two points of feedback!

For this toy agent, there is a distinction between 1) Errors in the llm output (mostly we can't pull a kernel from it) and 2) our api provider doing something weird like rate limiting. When putting feedback back into the llm we only care about the 1st error class (so Agent Error should only cover the first case) as the 2nd is not actionable by the agent. (ie. Claude can't do anything about us not paying our anthropic bill lol). It is somewhat useful to have an error class for 2 for when we run experiments, but it is not necessary, so I wouldn't include it in this PR.

The second point of feedback is there is a lot of redundancy in your code. Generally, if we can't pull a kernel out of the code (in this iteration of the agent), that's enough to say "ohh something is wrong here".

BackendBench/backends/kernel_agent.py

BackendBench/backends/llm.py

BackendBench/llm_client.py

PaliC · 2025-09-12T17:40:24Z

Also @ParamThakkar123 run pytest to make sure things work.

ParamThakkar123 · 2025-09-12T17:41:39Z

Sure @PaliC . I am using pytest to for testing. And all I noted all your feedbacks and suggestion. Will make sure all code changes are aligned with your feedbacks and I would all of them work. Thank you so much!

ParamThakkar123 · 2025-09-13T17:32:06Z

@PaliC I incorporated all your suggestions in this code and pushed it. Ran the formatter and tested it with pytest.

…nto AgentErrors

PaliC

Sorry for the late review. This is almost there, just a few more changes :)

BackendBench/llm_client.py

PaliC · 2025-09-17T16:55:45Z

BackendBench/backends/kernel_agent.py

 import os
 from typing import Callable, Dict

+from BackendBench.agent_errors import AgentError


please revert changes to the kernel_agent.py. I'm not really sure of the status of this / how it interacts with KernelFalcon.

BackendBench/agent_errors.py

PaliC · 2025-09-17T23:44:19Z

Also apologies for this, but there was another major refactor to this code. I'd recommend rebasing!

ParamThakkar123 · 2025-09-18T13:26:46Z

Sure @PaliC

…_agent.py

ParamThakkar123 · 2025-09-18T14:06:17Z

@PaliC All changes noted and implemented. Can you please review ?

PaliC

Sorry for the late review (currently traveling).

You'd want to adapt to the FeedbackInfo change (it's a struct and not a dict now.) You can just add AgentError as part of the struct similar to CompilerError.

Also if please do not categorize things as an AgentError if we cannot access the llm (ie. rate limits, bad connection, etc.)

BackendBench/backends/llm.py

PaliC · 2025-09-21T01:06:51Z

BackendBench/llm_client.py

+                )
+            response_data = response.json()
+            content = response_data.get("output", "")
+            if not content or "rate limit" in content.lower():


As discussed before these should not be agent errors as these are issues with connecting to the server.

…nto AgentErrors

ParamThakkar123 · 2025-09-21T18:41:36Z

@PaliC Updated the error logging functionality as suggested

PaliC

Would recommend taking another look, but almost there!

BackendBench/llm_client.py

BackendBench/backends/llm.py

ParamThakkar123 · 2025-09-22T03:19:04Z

@PaliC Made the changes

…nto AgentErrors

Added Error Logging

eac24fe

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Sep 12, 2025

Updates

25d090a

PaliC requested changes Sep 12, 2025

View reviewed changes

ParamThakkar123 added 3 commits September 13, 2025 22:19

rebase and agent_error fix

31dbe75

Updates

01426d2

Updates and Feedback noted

f2aad73

ParamThakkar123 requested a review from PaliC September 14, 2025 02:40

Merge branch 'main' of https://github.com/meta-pytorch/BackendBench i…

102a8cd

…nto AgentErrors

PaliC requested changes Sep 17, 2025

View reviewed changes

ParamThakkar123 and others added 2 commits September 18, 2025 10:01

Merge branch 'main' into AgentErrors

48400a5

Rename done, ConnectionError Defined and reverted changes from kernel…

e2f1c68

…_agent.py

ParamThakkar123 requested a review from PaliC September 19, 2025 07:10

PaliC requested changes Sep 21, 2025

View reviewed changes

ParamThakkar123 added 2 commits September 22, 2025 00:04

Merge branch 'main' of https://github.com/meta-pytorch/BackendBench i…

796d015

…nto AgentErrors

Updates

c172bf9

PaliC requested changes Sep 22, 2025

View reviewed changes

Updates

17f4446

ParamThakkar123 requested a review from PaliC September 22, 2025 03:18

Merge branch 'main' of https://github.com/meta-pytorch/BackendBench i…

a49c9fd

…nto AgentErrors

Added Error Logging #164

Are you sure you want to change the base?

Added Error Logging #164

Uh oh!

Conversation

ParamThakkar123 commented Sep 12, 2025

Uh oh!

ParamThakkar123 commented Sep 12, 2025

Uh oh!

PaliC left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

PaliC commented Sep 12, 2025

Uh oh!

ParamThakkar123 commented Sep 12, 2025

Uh oh!

ParamThakkar123 commented Sep 13, 2025

Uh oh!

PaliC left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

PaliC Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

PaliC commented Sep 17, 2025

Uh oh!

ParamThakkar123 commented Sep 18, 2025

Uh oh!

ParamThakkar123 commented Sep 18, 2025

Uh oh!

PaliC left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

PaliC Sep 21, 2025

Choose a reason for hiding this comment

Uh oh!

ParamThakkar123 commented Sep 21, 2025

Uh oh!

PaliC left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ParamThakkar123 commented Sep 22, 2025

Uh oh!

Uh oh!